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HISTORY  AND  INTRODUCTION 

Tests  of  significance  in  multivariate  analysis  refers  to  various  methods 
of  testing  hypotheses  in  which  samples  are  drawn  from  one  or  several  multi* 
variate  populations*  The  most  important  of  such  methods  is  the  analogue  to 
analysis  of  variance  for  a  single  variate.  This  technique  has  been  referred 
to  as  analysis  of  dispersion*  It  concerns  partitioning  sums  of  squares  and 
cross  products  of  a  set  of  samples  from  several  multivariate  populations* 
The  null  hypothesis  that  particular  mean  vectors  of  these  populations  are 
equal,  given  that  the  populations  have  equal  covariance  matrices,  is  tested* 

Criteria  to  test  for  equality  of  covariance  matrices  have  been  devel- 
oped* In  addition,  there  have  been  other  criteria  for  testing  hypotheses 
about  such  matrices  formulated*  A  criterion  to  test  if  a  subset  of  the 
variables  brings  out  further  differences  in  populations  when  differences 
resulting  from  the  rest  of  the  variables  are  removed  has  been  proposed,  and 
the  associated  method  has  been  called  internal  analysis*  Finally,  two 
useful  tests  have  been  derived  to  test  hypotheses  about  a  single  population. 
The  equality  of  means;  of  variancas  and  covariances;  or  of  means,  variances, 
and  covariances  of  the  variates  can  be  tested*  Also,  a  criterion  has  been 
devised  to  test  the  mutual  independence  of  subsets  of  the  variates. 

In  these  tests  the  problem  of  finding  percentage  points  is  difficult 
because  the  exact  distribution  is  unknown  or  too  complicated.  Approximations 
have  been  derived  from  the  moments  of  these  criteria  by  two  methods  of  ap- 
proach. First,  the  moments  have  been  used  to  fit  a  Pearson- type  curve* 
This  usually  gives  a  good  approximation  but  also  involves  tedious  and  com- 
plicated computations*  Second,  the  moment  expression  has  been  used  to  obtain 


A  chl-«quartt  approximation  to  a  function  of  tha  logarltha  of  the  statlstle. 

Baglnninga  In  tha  thaory  of  multlTarlata  analyala  data  back  to  1917 
whan  R.  A.  Flahar  found  tha  aaapling  dlatributlon  of  tha  alaaanta  of  tha 
eovarlanca  natrlz  for  a  blvarlata  normal  population.  Tha  extraaion  to  tha 
caaa  of  multiple  variataa  occurred  in  1928  when  Wiahart  found  the  diatri- 
bution  bearing  hia  name.  In  1931,  Hotelling  found  the  diatribution  of  a 
atatiatic  ^,  a  natural  multivariate  extenaion  of  the  univariate  caaa  of 
Student' a  ^  diatribution.  Next,  a  multivariate  analogue  to  the  variance 
ratio  teat  waa  developed.  Wilka  (1932),  uaing  the  likelihood  ratio  method 
that  had  recently  been  propoaed  by  Neyman  and  Pearaon,  obtained  diatribution 
generaliaationa  of  many  univariate  atatiatica  including  that  of  tha  analyaia 
of  variance  teat  criterion.  Thia  work  formed  a  baaia  for  many  developmenta 
in  multivariate  hypotheaia  teating. 

Generally  apaaking  the  difficultiea  found  in  hypotheaia  teating  were 
not  in  the  derivation  of  a  criterion,  but  in  finding  ita  exact  diatribution 
when  the  null  hypotheaia  vaa  true  and  in  evaluating  percentage  pointa  for 
practical  uaa.  Thua,  for  the  next  twenty-five  year a  there  were  numeroua 
derivationa  of  the  diatributiona  of  the  criteria  and  of  good  and  workable 
approximationa  to  them. 

Thia  report  will  be  devoted  to  a  diacuaaioa  of  the  topica  Mntioned 
above. 

VILK*S  1932  GONTRXBUTIOM 

Wilka  (1932)  defined  the  generalised  variance  of  a  aample  of  k  it 
from  a  p-variata  normal  population  to  be  the  p  order  determinant  |a^.| 
i^ere 
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and  X.   is  the  Yalua  of  th«  i   varlate  for  the  a   item  In  the  sanple, 
Wllka  then  found  the  general  moaent  and  the  distribution  of  js.  |.  He 

proceeded  to  derive  the  general  moment  of  the  distribution  of  the  ratio  of 
two  generalized  sample  variances.  Both  of  these  distributions  were  in 
integral  form  which  he  succeeded  in  evaluating  only  for  p  ■■  1  and  p  ■  2. 

In  1931  Pearson  and  Neyoan  had  proposed  a  maximum  likelihood  criterion 
X_  for  testing  the  hypothesis  that  n'  samples  are  drawn  from  a  subclass 

w  of  a  class  Q  of  admissible  populations,  where  Q  is  the  class  of  all 
sets  of  n'  univariate  normal  populations  and  u  is  the  subclass  in  each 
set  for  which  the  n*  populations  have  the  same  means  and  variances.  The 
maximum  of  the  likelihood  function  of  all  n*  populations  of  class  (2  is 

where  c  is  a  constant  depending  on  the  n  's,  the  number  of  individuals  in 

D 

the  3   sample,  s.   is  the  variance  of  the  &   sample,  s    is  the  pooled 
p  o 

n 

variance  of  all  n*  samples,  and  N  <■  ^  n..  The  maximum  in  u  is 
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and  the  likelihood  ratio  is 
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Wllka  found  the  generalization  of  this  to  be 


where  |8..«|  is  the  generalized  variance  of  the  B   sample  and  \^j^a^\ 

la  the  generalized  variance  of  the  sample  formed  by  coari>lning  all  n* 

samples.  Note  that  the  b'^  sample  consists  of  n  p-variate  observations. 
To  test  the  hypothesis  H(p)*  that  «'  is  the  class  of  all  sets  of 
p-variate  normal  populations  in  which  the  covarlance  matrices  are  equal, 
irrespective  of  the  means,  Wilks  used  the  criterion 


^H(p)' 
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Wilks  also  derived  the  general  moments  of  these  criteria,  and  evaluated  the 
distribution  functions  for  some  special  cases.  These  are  as  follows t 


Variance  Ratio      Degrees  of  freedoa 
n'-2,  for  any  p  "^  •  ^=f^  P  and  (N-p-1) 

n»53,  for  any  p  ^  •  ^=^  2p  and  2(N-p-2) 

p-1,  for  any  n  ^  *  ^        ^"'"^^  '"**  ^""'^ 

-.'■"■'  '       ''i 

p-2,  for  any  n  ^  •  ^i=?^       2(n'-l)  and  2(N-n»-l) 

The  quantity  X  denotes  either  of  the  criteria. 

ANALYSIS  OF  DISPERSION 

In  univariate  analysis  of  variance,  tests  of  significance  reduce  to 
coaparlng  two  Independent  nean  squares.  One  of  these  Is  an  estimate  of  the 
variance  to  which  a  single  observation  Is  subject  and  can  usually  be  referred 
to  as  error  variance.  The  other  Is  such  an  estlnate  only  when  the  null 
hypothesis  Is  true,  and  nay  be  called  the  mean  square  due  to  deviation  from 
hypothesis.  In  the  multivariate  case,  If  each  observation  consists  of  p 
varlates,  then  there  are  p  total  sums  of  squares  and  1/2  (p)(p->l)  stnas  of 
cross  products.  Analysis  of  dispersion  consists  of  analyzing  each  of  these 
Into  Independent  categories.  The  dispersion  due  to  any  category  gives  a 
generalized  variance.  As  In  the  univariate  case  an  error  ten  Is  available, 
which  consists  of  estimates  of  the  variances  and  covarlances  of  a  set  of 
variables.  The  corresponding  matrix  due  to  any  other  category  leads  to  such 
estimates  only  when  the  null  hypothesis  Is  true.  Thus,  as  In  the  univariate 
case.  It  Is  possible  to  compare  Independent  "mean  squares'*  In  analysis  of 
dispersion. 


Many  ptibllcatlons  have  appeared  In  statistical  literature  on  the  analysis 
of  dispersion  for  particular  experinental  layouts  and  designs.  Presently,  only 
the  general  linear  hypothesis  will  be  considered,  since  the  theory  and  meth- 
odology can  readily  be  BK>dified  for  a  particular  case. 

Let  n  be  the  number  of  samples  T  (i"l,  2,  .  .  .  ,  n),  where  Y   is 
the  (pxl)  vector  of  the  observations  on  the  p  correlated  variables,  such 
that  the  Y  '  are  normally  and  independently  distributed  with  mean  vector 
II.'  and  covarlance  matrix  I   .  Thus,  if  Y  is  the  (nxl)  vector  of  the  Y^', 

Y  is  a  (nxp)  observation  matrix.  The  Joint  distribution  of  the  Y   would  be 


(2n)2   |I|2  I 

Let  the  model  be  denoted  Y  ■•  A^  •»■  e.  Then  A  is  the  (nxm)  design  matrix 
whose  elements  describe  the  design  under  which  the  data  are  obtained  and  which 
has  rank  r  ^  m  <  n.  If  A  is  singular,  it  must  be  constrained  so  that  it 
has  an  inverse,  or  so  that  ^  is  linearly  estimable.  The  effects  of  the 
factors  involved  in  the  experiment  are  denoted  as  ^,  an  (mxp)  matrix  of  unknown 
parameters.  The  rows  of  the  matrix  c  are  assumed  to  be  normally  and  inde-* 
pendently  distributed  with  mean  Z  and  covarlance  matrix  ^,  where  Z  la 
a  null  matrix.  Also,  p  <^  (n-r) ,  so  that  the  sample  error  matrix  is  positive 
definite  almost  everywhere. 

The  null  hypothesis  to  be  tested  is  that  C^  •■  Z  where  C  is  an(sxm) 
matrix  of  rank  s<,r<m<n,  which  operates  on  4  to  give  differences  in 
corresponding  parameters  which  are  being  tested  for  equality.  It  is  usual  to 
make  the  first  row  of  C  a  vector  of  units,  but  this  is  unnecessary.  The 


alternative  hypothesis  to  be  considered  is  that  at  least  one  of  the  equalities 
in  the  systen  Ci^  •  Z  does  not  hold. 

By  the  operations  admissible  in  matrix  algebra,  the  quadratic  form  of  the 
likelihood  function  may  be  manipulated  as  follows: 


"  T-1 


(tr)  Sum  (Y/  -  w/)  r  (Y.  -  V.)  - 

i-1    ^  y-^ ., 


;»"-   ■"-"»,  , 


■/      » 


1  " 
tr  r  Sum  (Y  -  mJ(Y.'  -  y.*) 

i-1 


tr  r^(Y'  -  ♦•a'XY  -  A*)  - 

tr  r^  [Y»Y  -  Y»A(A»A)''^  A'Y  +  (J'  -  ♦')A'A(*  -  ♦)]  - 

tr  r^  [E  +  (♦'  -  ♦•)  A'A(J  -  ♦)] 

where  E  -  Y*Y  -  Y*A(A'A)'^  A*Y  and  tr  denotes  the  trace  of  the  matrix.  In 
the  first  equation  the  matrix  has  dimension  (1x1)  and  thus  taking  the  trace 
does  not  alter  the  expression.  In  general,  L  will  be  maximised  when 

^  ♦  -  (A'A)"^  A*Y.  Thus,  in  fi 


np    ^ 

(2n)2  |X|2 


exp 


H'-r^-'] 


It  can  be  shown  that  \     is  maximised  by  ^  <■  -*  E.  Thus, 
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n 
2 

(2n)2  |E|2 


[-H  ■ 


Under  the  null  hypothesis  the  quadratic  form  can  be  partitioned  as 
follows  I 

tr  r^  (Y«  -  ♦'A')a  -  A4)  - 

tr  r^  F(Y»Y  -  Y»A(AU)"^  A»Y)  +  (Y»A(A»A)'^  C»  [C(A»A)'^  C']"^  C   . 

(A'A)"^  A'Y)  +  (♦•  -  ♦•)A'A(J  -  ♦)  ^  - 

tr  r^   [E  +  H  +  (♦•  -  ♦•)A'A(J  -  4)3 
where 

i  -  (A'A)"^  A'Y  -  (A»A)"^  C"^  [C(A«A)"^  Cf^  C(A'A)"^  A'Y 
-  (A'A)"^  (I  -  C"^  [C(A'A)**^  C']"^  C(A'A)"^]  A»Y 
and  H  -  Y»A(A»A)'^  C' (CCA'A)"^  C'j'^  C<A'A)'^  A'Y  . 

A 
4k  * 

In  k>  »  letting  4  '  ^»  and  ^  "  ~  (E-l-E),  the  maxloum  likelihood  estimates. 
It  Is  found  that 


L(«)   - 

A 
2 

n 

BL           E 

exp 

(2n)^     h+b|^ 

Therefore , 

[-7  ^^]    • 


Mr 

-  2  In  X  -  -  n  In 

■  |H+E| 

It  should  be  noted  that  E  Is  the  matrix  due  to  error  and  H  is  the 
matrix  due  to  the  hypothesis.  Also,  H  is  symmetric  and  at  least  positive 
semidefinite.  Thus»  the  criterion  can  take  on  valuea  only  on  the  unit 
interval.  A  necessary  and  sufficient  condition  for  a  criterion  to  be  unity 
is  that  the  null  hypothesis  be  identically  true.  When  the  null  hypothesis 
is  "accepted",  the  average  value  of  the  criteria  will  be  less  than  one  under 
repeated  applicatira,  but  it  will  be  larger  than  t^en  the  hypothesis  is 
rejected. 

A  basic  theorem  has  been  proven  by  Wilks  (1938).  It  states) 

If  a  population  with  a  variate  x  is  distributed  according  to 
the  probability  function  f(x,8. 6.  ),  such  that  optimum  estimates 

e  of  the  6   exist  which  are  distributed  in  large  samples  according 

to 

1        - 


h£  .,.Li  \ 


(2n)2 


T       ^     *»J  *        -^ 


where 


^i  "  ^h  -  e^)  -^  . 
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||c. .||  is  potltiYS  definite,  E  denotes  nathematical  expectation, 

and  ^  is  of  order  l/i^,  then  when  the  hypothesis  that  ^4  "  ^  4» 

i"vfl,  ar»-2,«.,,h  is  true,  the  distribution  of  -2  In  L,  where  L  is 
a  likelihood  ratio  criterion  is,  except  for  terns  of  order  l/«^, 
distributed  like  a  chi'square  with  Qx-^a)   degrees  of  freedom. 

The  degrees  of  freedom  is  the  difference  in  the  nunber  of  paraaeters  used  to 

aaxlnlze  the  likelihood  function  in  Q     and  u  *  This  was  proven  by  showing 

that  the  characteristic  function  of  -2  In  L  is  identical  to  that  of  the  chi- 

square  distribution.  Thus 

-  n  In  (-ijSl-   r\j  c[p(k-l)l 
where  c(b)  denotes  the  chl-square  density  function  with  n  degrees  of  freedom. 

where  q  is  one  less  than  the  number  of  populations  k. 

Rao  (1948)  obtained  the  asymptotic  distribution  of  V^  in  the  form 


-  ^2'  <%,+♦  -'»,'!■    *  •  •  • 


Where     P    ^  is  the  density  function  of  chi-sqiiare 
with  pt{+9  degrees  of  freedom,  and 


m  -  n  -i  (p+q+1), 
T,  -If  (pV-5), 


'2       48 
and 


4  "  -f"    •*•  1920     t3p*  +  3q*  +  lOp^q^  -  50   (p'  +  q*)  +  159]  . 


Oth«r  values  of  tbe  y^^  "Ay  be  calculated  If  a  tiore  exact  probability  is 

desired.  The  first  tern  consists  of  the  approxiaation  suggested  by  Bartlett, 

2 
mad  the  second  ten  is  of  order  1/b  . 

Rao  (1951)  obtained  a  closer  approximation  by  considering  that  if  Y  is 

the  s   root  of  A,  where 

then  the  distribution  function  of  Y  can  be  written  as 

-  »{^*  e,  r  W  +  .  .  . 
where 


e--M=^. 


^ !!*! (r-lH5r>7)  ] 

(  16r(r+l)(r+2)(r+3)        5760    J  » 


^        Tit)       (  16r(r+l)(r+2)(r+3) 

and  6(t,u)  is  the  distribution  function  of  the  beta.  The  first  ten  offers 

a  powerful  approximation,  the  second  being  of  order  1/m  .  Also,  Rao  has 

shown  that  . 

1  -  X*   .   as't-2e 
1         2r 

.s 


u 


can  be  used  as  a  variance  ratio  with  2r  and  (ns-f26)  degrees  of  freedom* 
The  quantity  (ms-t-26)  need  not  be  integral.  The  exact  distribution  functions 
given  by  Wilks  (1932)  are  also  applicable  in  analysis  of  dispersion.  They 
can  be  used  by  letting  q"n*  -  1  and  n«"N-l. 

Other  work  in  this  area  has  been  done  by  Kabe  (1962) »  Katti  (1961),  and 
Boy  (1951).  Kabe  shows  that  the  distribution  of  several  test  criteria  which 
can  be  expressed  in  terns  of  gamaa  functions  is  identical  with  the  distribution 
of  a  linear  function  of  gamma  variates.  Katti  shows  that  random  orthogonal 
transformations  can  be  tised  to  show  that  the  likelihood  ratio  is  distributed 
as  the  product  of  independent  beta  variables.  Koy  derived  the  series  expansion 
of  the  distribution  of  a  number  of  likelihood  criteria,  and  demonstrated  the 
accuracy  of  using  the  first  term  as  an  approximation. 

INTEBNAL  ANALYSIS 

Let  X.,  •  •  •  »  '  »  ^a^i*  *  *  *  >  ^B4ti  ^^  ^'^  dependent  variables  for 

which  samples  of  else  n.,  •  •  •  ,  n.   are  taken  from  k  normal  populations. 
It  is  possible  to  test  if  the  variables  x  .,,...  ,  x_.   bring  out  further 
differences  in  populations  when  differences  due  to  x. ,  .  .  .  ,  x   are  re- 

X  8 

moved.  This  was  done  by  Rao  (1948)  and  is  accomplished  by  following ^technique 

developed  by  Wishart.  Let  (T  )  -  (H  )  +  (E  ),  i,j-l,  2,  .  .  .  ,  »fp 

be  the  analysis  of  dispersion  for  the  »fp  variates  with  degrees  of  freedom 

a  "  q  -f  (n  -  q).  The  sum  of  products  matrix  due  to  error  for  the  variables 
x^,  .  .  .  ,  x^  is  to  be  eliminated.  Partition  the  matrix  (E.  .)  as  follows t 


ij' 


^11*       hi" 


hi*  hi* 
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wh«r«  E..*  la  (sxs).  Than  Che  sum  of  products  matrix  due  to  error  for 
x,»  •  •  •  »  »,  i«  given  by 

E(p/8)  •  E22*  -  E2j*(E^^*r^  Bj^2*  . 

Similarly,  T(s/p)  can  be  found  and  the  criterion  is  the  ratio  of  the  former 
to  the  latter.  The  associated  degrees  of  freedom  are  (n  -q-s)  and  (n  -s). 
The  approxlnatlons  in  internal  analysis  are  the  sane  as  those  in  an- 
alysis of  dispersion  except  for  adjustments  in  the  number  of  variates,  since 
internal  analysis  is  essentially  a  conditional  analysis  of  dispersion.  Again 
this  procedure  can  be  extended  to  any  partitioning  of  the  variances  and 
cover lances. 

TESTS  OF  EQUALITY  IN  A  SINGLE  POPULATION 

Using  the  method  of  naxlnum  likelihood,  Wilks  (1946)  developed  test 

criteria  L™-,  L^  ,  and  L^  for  testing  the  following  hypotheses  about  the 

■■'■'     '•,,.■•■■'■■'.■ 
parameters  of  a  single  p-variatc  normal  popularioni 

H_.^:  The  means  are  equal  and  the  variances  and  covariances 
are  equal; 

lU  t       The  variances  and  covariances  are  equal,  irrespective  of 
the  means; 

H^t    The  means  are  equal,  given  that  the  variances  and  covariances 
are  equal. 

The  sample  criteria  turn  out  to  be  the  following: 
SiVC  "  He  Hi    • 

NC       p     p-1 

(s^)  (1-r)    (l+(p-l)r) 


and 


s. '^^ 


(•ha-T)  +-ij  I  a^-i) 


p-^i-i 


respectlvsly. 


when 


•'-H.«. 


PA 


*  ■  P  i,  *i' 


1-1 

and  a.,  and  x.  ara  aa  pravloualy  daflnad.  Thua,  thara  ara  k  p^varlata 

obaarvatione.  Tha  ranga  of  aach  erltarlon  la  tha  unit  Interval. 

To  find  thaaa  crltarla  Wllka  uaad  tha  aaxlBum  llkallhood  aathod,  ob- 
taining a  llkallhood  ratio  \      by  appropriate  oparatlona.  The  teat  criteria 
ware  the  2/k'^  root  of  X^^   and  \^^,   and  the  2/k(p-l)"'  root  of  X^, 

denoted  by  I^q»  I^^»  and  L^.  To  uaa  the  three  criteria  their  dlatrlbutlon 
under  the  null  hypotheala  had  to  be  found.  Wllka  did  thla  for  p"2  and  p"3 
for  I^^  and  L^^,  and  generally  for  I^.  The  dlatrlbutlon  functions  ware 
found  from  the  general  monants  by  fomulatlng  functions  alallar  to  nonent 
generating  functions  and  evaluating  particular  derivatives.  Since  the 
criteria  can  have  valuaa  between  aero  and  one,  the  moaents  detemlned  tha 
dlatrlbutlon  uniquely.  For  L^,  when  H^  la  true,  the  distribution  la 
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r  fi  k(p-i)l        i  (k-i)  (p-i)-i      7(p-i)-i 

dF(L^) ^ ^ 1^^  (1  -  I^)^  ^ 

r[f  (k-i)(p-i)]  r[i(p-i)]    ^  ^ 

For  p*>2,  the  saapllng  distribution  functions  of  L^  and  L_^  can  be  found  to 

^(k-4) 


«»^<H(VC>"I<^-2)(L^^)2      dl^g 


r[i(k-i)]      1         _i 


n^  r  ^  (k-2) 


and  for  p*3f 


These  distribution  functions  are  valid  only  if  the  respective  hypotheses  are 
true.  It  should  be  noted  that  to  test  H^,  an  F-ratio  can  also  be  used*  This 
ia 


•l 


(k-l)(p-l)(l-L„)  ^   , 

(p-l)(k-l) 


where 
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8  -  k  I     (x  -  x)  , 
^     i-1 

-2-  ^,  }j\a'K'h''^^^' 


0-1  i-1 


and 


1  *^ 

■^  a-1 

This  is  equivalent  to  an  analysis  of  variance  to  test  "row"  effects  in  a 
(pxk)  layout  where  rows  are  associated  with  the  p  variates  and  coluoms  with 
the  k  iteas  in  the  sanple. 

Other  criteria  to  test  special  hypotheses  about  a  single  p-variate 
nomal  satsple  have  been  given  by  Bhapkar  (19S9),  Fertlg  and  Leery  (1936),  and 
Mauchly  (1940).  Bhapkar  gives  a  criterion  to  test  that  all  covariances  are 
zero.  The  criterion  of  Fertig  and  Leary  may  be  used  to  test  that  the  means, 
variances »  and  covariances  have  specified  values.  Mauchly 's  test  criterion  is 
used  to  test  that  the  variances  are  equal  and  the  covariances  are  all  zero. 

Varaa  (1951)  developed  methods  to  find  the  exact  distributions  of 
L_.-  and  L^.  in  the  form  of  infinite  series,  which  are  numerically  workable 

in  calculating  significance  to  any  level  of  accuracy.  Beginning  with  the 

moments  and  using  various  complex  transformations,  he  showed  that 

It,  e  ^  1 


"<Src'       2r((k-3)l  *S<f 

"'Hnrc'         4r(k-3)  "Sivc^ 
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By  using  Sterlings'  asynptotlc  expansion  for  r(x+h)  and  a  factorial  series 
suggested  by  Nalr,  these  can  be  evaluated.  The  one  and  five  percent  prob- 
ability levels  for  the  criteria  are  given  for  p-4,5,6,7  and  various  values  of 

k. 

The  preceedlng  theorea  by  Wllks  (1938)  Is  also  applicable  to  these 

criteria.  Thus, 

-k  In  L^j.  ^  C[|  p(p+3)-31 
-k  In  Lyg   r^  C[j  p(p+l)-21 

-k  In  Ljj   r\->    C[p-ll  . 

Tukey  and  Wllks  (1946)  derived  approximate  distributions  for  L^^  and 
Ly  In  the  form  of  Pearson's  Type  I  distribution.  Approximate  percentage 
points  can  be  obtained  by  using  tables  of  the  incomplete  beta  function. 
They  Justified  this  by  showing  that  the  distribution  of  a  product  of  beta 
variables  can  be  approximated  by  a  single  beta  variable, 

i  tt-1       "i  v-1 
(l')    (1  -  l') 


e(u,v) 

where  L  denotes  any  one  of  the  criteria,  v  and  r  depend  on  p,  and  u  depends 

also  on  k.  Some  special  cases  are  as  follows t 

p            criterion  v  « 

2                 L^,  i(k-2)  I 

2  I^c  h^'^>  2 

3  W  ^'^  ^ 
3                I^g2  k-3  2 

P                 Ljj  i  (k-l)(p-l)  I  (p-1)    . 


r»-"-- 
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Pearson's  tables  of  the  Inconplete  Beta  Functions  are  entered  with  v    and 

tt.  If  Thompson's  tables  are  employed,  2v  and  2u  are  used.  Tukey  and  Wilka 

have  tabled  other  values  of  v  and  u  for  particular  values  of  p  and  r. 

INDEPENDENCE  OF  SUBSETS  OF  VARIABLES 

Vllks  (1935)  developed  a  criterion  to  test  the  independence  of  groups 
of  variates.  If  k  individuals  are  measured  on  p  variables  with  a 
multivariate  normal  distribution,  and  if  these  variables  are  grouped  into  t 
groups,  the  m   group  having  n  of  the  variates,  the  hypothesis  to  be  tested 
is  that  the  t  sets  are  mutually  independent.  The  criterion  is 

l'„l        Ir<,l 


»i-T 


"'      '  " 


A  '•«-'   Jli  ''«-' 


where  r. .  is  the  sample  correlation  coefficient  of  all  individuals  between 

the  i^^  and  j'^  variables,  |s..  |  is  the  determinant  of  the  covarianee  matrix 

Ijm 

of  the  m^^  set  of  variables,  and  |r..  |  is  the  determinant  of  the  correlations 
within  the  m'  group.  Again,  the  values  of  this  criterion  lie  on  the  unit 
interval.  Wilks  fotmd  the  distribution  of  this  criterion  for  special  groupings 

into  two  and  three  subsets.  In  the  case  of  division  into  t%fo  groups,  one 

2 
of  one  variable,  the  other  of  p-1,  X^  reduces  to  1-R  where  R  is  the  multiple 

correlation  coefficient. 

Wald  and  Brookner  (19A1)  have  shown  that  the  distribution  of  the  criterion 

can  be  obtained  in  a  beta  series,  and  the  distribution  of  the  logarithmic 

statistic  in  a  gamma  series.  Wilks  has  proven  that  k(l-X-)  is  distributed 

as  chi-square  with  y  degrees  of  freedom,  where 
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Pistil  .  I    ^V^> 
2      ail    2 

if  k  is  large  coapared  with  p.  Further,  the  probability  of  obtaining  a 
value  of  X-  leaa  than  an  observed  value,  say  X,  is 


Pr{Xji  X}  -  1  -  ll 


.^) 

1 

ru(v 

-^1)' 

e-^x^dx 

where 

^  <»»'^>  "  r(W   o^ 

la  the  incoaplete  gaama  function. 

TESTS  W   COVARIANCE  HATRICE8 

In  addition  to  Wilks'  1932  criterion  to  test  the  equality  of  covariance 
aatrices,  Plackett  (1947)  has  developed  a  test  which  is  essentially  the  sane 
as  the  analysis  of  variance  procedures  developed  by  Fisher.  To  test  the 
equality  of  a  set  of  paraiiaters  is  equivalent  to  testing  whether  (n-1) 
orthogonal  linear  functions  of  the  paraaeters  each  vanish.  If  p"l  and  a 
typical  observation  from  the  ■   distribution  is  t  (a  -  1,  2,  .  .  .  ,  n), 
then  n  mutually  orthogonal  linear  functions  can  be  formed,  one  being 

u  ■  t,  +  t,  +  .  .  .  +  t  . 
12  n 

If  the  (n~l)  covarlances  of  u  and  all  of  the  other  linear  functions  are  tero, 
then  the  variances  of  the  n  distributions  must  be  equal.  Thiis,  the  problem 
becomes  one  of  testing  the  independence  of  two  groups  of  variates,  one  of  p 
equations  and  one  of  the  p(n-l)  other  orthogonal  linear  functions. 

If  the  6    population  has  sample  covariance  matrix  a. ..(3*1,2,.. .,0),  the 
hypothesis  to  be  tested  is  that  a^^o^a. .  ,3l'e"*  1,2,  .  •  •  ,  n.  The 
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observatiooa  nay  be  written  In  a  natrix  of  dlnenslons  (lump).  Thus  the  1 
obeervatlons  on  the  S   population  will  appear  In  eoluflm  (l~l)n-f-0.  Let  the 
observation  In  the  a   saaple  In  this  column  be  denoted  as  x.   .  The  order 
of  the  elements  In  a  eoluoa  is  assumed  to  be  random;  if  this  is  a  faulty 
assumption >  it  is  necessary  to  rearranga  segments  of  the  rows  of  the  ob- 
servation matrix  since  the  criterion  depends  on  a  random  arrangement* 
Plackett  proves  the  following  theorem! 

2p 

W(n,p) E^r 

S(n,p)  S  (n,p) 

is  distributed  like  Wilks*  statistic  for  testing  the  hypothesis  that 
two  groups  of  variates  of  sizes  p  and  p(n-l),  known  to  have  been 
drawn  from  a  np-varlate  normal  population,  are  mutually  independent* 
If  the  groups  are  in  fact  mutually  Independent,  then  s. ..  *•  »ja^» 

The  symbol  S(n,p}  denotes  the  sum  of  all  n  ^  signed  minors  formed  by  rows 

6.,  S,*****^  '^^   columns  c.,  e,,..*,c  of  a  matrix  G  «  H*H  where  H  is  the 

(kxnp)  matrix  in  which  an  element  is 

u  8     B   1   T  ,  P 
lo    lo    k  ^j  lo 

In  application,  this  criterion  is  restricted  to  equal  sample  size  k  when 
k  >  np  * 

To  test  for  homoscedastlclty  in  the  univariate  case.  Box  (1949)  extends 

the  criteria  that  Bartlett  (1937)  proposed  as  a  test  statistic. 


M  -  N  In  S  -  I     V^  In  S-, 

0-1  ^     ^ 

where 

k 


"  8-1  ^  ^      0-1  ^ 


n 


aad  •.  is  the  unbiased  Mtlnuitc  of  tha  variance  In  the  6   group  baaed  tm 
auaa  of  aquares  having  v.  degrees  of  freedom.  Suppose  s...  Is  the  un- 
biased  estimate  of  the  variance  or  eovarlance  between  the  1   and  j 
variables  In  the  $   sample  based  on  sums  of  squarea  having  v.  degrees  of 
freedom.  If  there  are  k  such  samples  and 

then  Bartlett*s  criterion  can  be  generalised  aa 

-  -  »  In  Lj^ 


where 


^   8-1  \  I  'ij 


I  ^  N 


I 

Box  (1949)  has  provided  several  approximations  to  this  criterion.  By  finding 
the  general  moment  of  a  function  of  thia  statistic;  the  characterlatlc  function 
of  c*M,  where  c*  la  a  conatant  less  than  or  equal  to  one;  and  ualng  an 
Inversion  theorem,  the  probability  density  function  of  c'M  was  found  as  an 
Infinite  series  which  can  be  approximated  by  the  chl-square  distribution.  ; 
Box  found  that  If  p  >  1  \< 

fr  '^    Cli(n-l)p(p+l)l, 
where 
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If  the  degraea  of  freedom  are  equal  this  becomes 


J: 


1       (2p^  ♦  3p-l)(fc»l) 
t»"^ 6(p+l)kVp       • 

This  approxlaatlon  Is  Inadequate  when  p  Is  large  and  v^  Is  saall* 
Box  has  also  shown  that 


vhere 


and 


7^^      ro       FCfj.fj), 


^  ■  1  -  A^'i  f,/f,  «^i>s' 

■  1  ■  A,'^  2/f,  «A,<A^2^ 


^    6(k-i)(p*i)   [  gf^vg    m;     ; 


0 


'l  -  2  P<^^^^"^^» 


f.-^*' 


^  2,  • 


■     K-t 


1^   -   VI 


■;nrT  ^ 


«i'i 
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Xhia  •pproxlnatlon  is  easily  coaputcd  with  unequal  sample  sizes. 

George  (1945)  finds  the  exact  distribution  for  the  criterion  developed 
by  Wilks  to  test  for  homogeneity  of  variance  for  special  cases  of  equal  and 
unequal  sample  aizes.  Nair  (1938)  has  given  a  formal  solution  to  the  dis- 
tribution of  the  2/n   root  of  L.  and  for  the  case  of  samples  of  equal  sizes 
the  exact  expression  has  been  derived  in  certain  instances.  Bishop  (1939) 
has  developed  an  empirical  method  of  fitting  a  Type  I  curve.  He  also  obtained 
an  approximation  for  the  general  case  in  terms  of  the  chi-square  distri- 
bution. Gnanadesikan  (1960)  gives  methods  to  test  equality  of  more  than  two 
dispersion  matrices  against  certain  alternatives.  A  method  analagous  to  least 
squares  has  been  used  to  derive  the  criteria* 

DISCUSSION  ;  ;'  .#'-•,  ...        ^-^ 

The  methods  of  multivariate  analyses  are  useful  for  many  reasons.  First, 
in  BUkny  cases  with  little  additional  cost  to  an  experimenter,  more  information 
can  be  obtained  on  each  individual  item.  Then,  if  additional  measurements 
provide  no  additional  information,  as  can  be  tested,  the  loss  would  be  slight 
as  compared  to  the  conceivable  possible  gain  for  purposes  of  prediction  or 
control.  Second,  in  many  disciplines  experiments  which  are  multivariate  in 
nature  are  being  performed,  but  univariate  methods  of  analysis  are  being  ap- 
plied by  assuming  repeated  measurements  to  be  independent.  Not  only  are 
basic  assumptions  being  violated,  but  the  total  information  in  the  sample  is 
not  being  utilized,  and  faulty  concltisions  can  result.  Third,  with  the 
availability  of  high  speed  computers,  the  cost  and  difficulty  of  calculation 
are  minimized.  Further,  tiiM  is  no  longer  a  major  factor  in  the  choice  of  a 
method  of  analysis. 
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Applications  of  the  above  aethods  are  unlimited.  Kendall  (1961)  gives 
examples  in  many  areas,  illustrating  many  of  tbe  different  methoda.  In  fact, 
every  experimental  discipline  has  use  for  multivariate  analysis. 
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Th«  derivation  and  use  of  criteria  to  test  hypotheses  about  the 
paraaaters  In  a  mvltlTarlate  population  vara  glvan  lapatas  by  tha  work  of 
S.  S«  Wllks.  One  of  tha  most  significant  results  was  analysis  of  dispersion. 

In  analysis  of  dispersion  tha  criterion  Is  used  to  test  the  hypothesis 
that  tha  aaan  yactors  of  n  populatlona  are  equal.  It  la  aasumcd  that  the 
covarlance  aatrlces  are  equal;  criteria  to  test  this  assumption  are  given. 
A  general  procedure  of  analysis  of  dispersion  Is  outlined  under  the  general 
linear  nodal  and  the  assuaptlon  of  normality.  Internal  analysis,  which  Is 
a  conditional  analysis  of  dlaperslon,  is  also  discussed  briefly. 

Various  tests  of  equality  of  the  parameters  In  a  single  population  are 
presented.  Also,  the  criterion  to  test  for  Independence  of  subsets  of  the 
varlates  In  a  single  population  Is  given. 

Various  approximations  to  the  distributions  of  all  the  above  criteria 
are  given.  Thoae  approximations  that  are  superior  because  of  ease  of  com- 
putation or  because  of  goodness  of  fit  are  enumerated  more  fully. 
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